RECOVERY trial: Tocilizumab and COVID-19

The Randomised Evaluation of COVID-19 Therapy (RECOVERY) is a platform trial that tested several treatments for COVID-19 since the beginning of the pandemics in 2020. Preliminary results on the anti-inflammatory drug tocilizumab were published on medRxiv on February 11, 2021

The RECOVERY was a large, open-label randomized controlled trial that assessed 28-day mortality as the primary outcome in hospitalized patients with COVID-19 using a frequentist statistical approach. Secondary outcomes were: discharge from hospital within 28 days in all patients and risk of invasive mechanical ventilation (IMV)/death among those not receiving IMV at baseline.

Compared to the control group, there was a reduction in overall mortality in the tocilizumab group (rate ratio of 0.86; 95% confidence interval 0.77-0.96). Patients receiving tocilizumab were also more likely of being discharged alive within 28 days (rate ratio of 1.22; 95%CI 1.12-1.34). Lastly, among those not receiving IMV at baseline, patients allocated to tocilizumab were less likely to reach the composite endpoint of IMV or death (risk ratio 0.85; 95% CI 0.78-0.93).

These results are of great public health interest. To date, there are only a few effective treatments for COVID-19 infection, such as corticosteroids. Interestingly, in the RECOVERY trial on tocilizumab, 82% of patients in both treatment arms were using corticosteroids. Therefore, the effects mentioned above are complementary to the already established beneficial effect of corticosteroids.

Subgroup analyses

Subgroup analyses are key for clinical research. It allows one to either further explore or confirm hypotheses based on the data.

In the RECOVERY trial, the authors reported several pre-specified subgroups analyses. Two clusters of subgroups are of distinguished clinical interest: respiratory support and days since symptom onset.


For COVID-19 infection, duration of disease and use of oxygen are crucial to stratify the risk of patients. Respiratory support should be interpreted as a direct proxy for severity of disease. Patients on invasive mechanical ventilation are naturally at greater risk of death than patients on only simple oxygen. Also, previous research has shown that dyspnea most commonly develops after a median of 5-7 days since the onset of symptoms.

Here are the results of the RECOVERY trial on the outcomes and subgroups mentioned above:

28-days mortality

Adapted from the RECOVERY trial: Effect of allocation to **tocilizumab** on **28−day mortality** by baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to tocilizumab on 28−day mortality by baseline characteristics


Discharge from hospital within 28 days

Adapted from the RECOVERY trial: Effect of allocation to **tocilizumab** on **hospital discharge** by baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to tocilizumab on hospital discharge by baseline characteristics


Invasive mechanical ventilation or death

Adapted from the RECOVERY trial: Effect of allocation to **tocilizumab** on **invasive mechanical ventilation** or **death** in those **not** on **invasive mechanical ventilation at randomisation**, by baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to tocilizumab on invasive mechanical ventilation or death in those not on invasive mechanical ventilation at randomisation, by baseline characteristics



Outcomes

Death, the risk of receiving invasive mechanical ventilation, and time to hospital discharge are crucial for both the patient and society’s benefits. Overall, tocilizumab seems to be effective for these outcomes. In addition to usual care, this medication could save several lives (NNT = 25 for mortality) of hospitalized patients with COVID-19.

However, tocilizumab may have different effects in each subgroup. It has been postulated that severity of COVID-19 disease is closely related to the degree of inflammation and length of disease. Thus, one could expect different results between the subgroups mentioned above. Yet, the subgroup results of this trial are not consistent between outcomes. Due to this difference, it becomes questionable whether these results are accurate enough to aid in specifying clinical decisions to subgroups of patients.

Subgroup analyses are key for estimating heterogeneity of tocilizumab’s effect on different groups of patients. It is clinically and economically important to determine if there is sufficient discrepancy to ascertain different treatments for specific subgroups.

Limitations

Frequentist subgroup analyses pose several limitations. Their intrinsic low statistical power increases both type 2 error, leading to false-negatives results, and type 1 error risk, inflating treatment effect. If these analyses are not appropriately interpreted, it could generate misleading conclusions about the effect of tocilizumab in specific subgroups.

It is of utmost importance to define if results from a clinical trial are clinically meaningful. Of note, P-values only provide information about the data’s incompatibility to the null hypothesis (effect equals zero). It is impossible to infer whether the true effect size is large or what is the probability of the alternative hypothesis (effect different from zero) of being true based on P-values. Also, the use of a frequentist approach to subgroup analysis can lead to multiplicity issues.

Confidence intervals (CI) can improve the interpretation of results beyond P-values. However, they are also easily misinterpreted. CIs contain a range of parameter values that are more compatible with the data than are values outside the interval. A single CI does not provide any information on the true effect size. CIs can only inform about the compatibility of data from this trial to an arbitrary threshold (e.g., 95%). It does not inform what is the probability of a range of effects (eg, RR < 1, 0.9, 0.8 etc) to be true. All of this information would be of great interest to clinicians.

Thus, we believe that Bayesian analysis can clarify the interpretation of these results.

A Bayesian reanalysis

The Bayesian statistical framework relies on defining priors that capture our belief on the true effect. These, combined with new data (likelihood), generate a posterior probability (PP). PP can be described with credible intervals, which, in contrast to confidence intervals, inform the probability of an effect given the prior and likelihood. We can also perform multiple looks at the PP without worrying about multiplicity issues.

We will perform a post hoc reanalysis of the RECOVERY trial on tocilizumab.

Rationale underlying our priors

We have decided to define the parameters of our priors for each subgroup on two fronts:

  1. We will perform a systematic review of randomized controlled trials on tocilizumab and COVID-19 (PROSPERO registration number …..)
    • To create an evidence-based prior
    • This prior will not be further described here since the systematic review has not been performed yet
  2. Based on Zampieri et al. 2020, we also decided to choose four other prior beliefs:
    • Non-informative (uniform)
    • Skeptical
    • Optimistic
    • Pessimistic

The non-informative prior is equal in every subgroup. Thus, we will not report any analysis on this prior in this file.

Below, we explain the rationale underlying the parameters for our prior beliefs and belief strengths:

Belief Strength
Prior Belief Prior Mean Rationale Underlying the Mean Weak Moderate Strong
Neutral 0.00 Null effect Uniform (all values equally possible) Pr(> 0.1) & Pr(< -0.1) = 0.05 Pr(> 0.075) & Pr(< -0.075) = 0.05
Optimistic 0.05 Estimated risk difference in sample size calculation Pr(ARD > 0) = 0.30 Pr(ARD > 0) = 0.15 Pr(ARD > 0) = 0.05
Pessimistic -0.05 Mirrored from the optimistic prior Pr(ARD < 0) = 0.30 Pr(ARD < 0) = 0.15 Pr(ARD < 0) = 0.05
Note:
Adapted from Zampieri et al. 2020; Absolute Risk Difference (ARD) < 0 means tocilizuamb is better


Because we will also create an evidence-based prior, we have decided to define the parameters of our other priors based on data from similar drugs to tocilizumab on COVID-19. Tocilizumab is an anti-inflammatory drug commonly used in patients with rheumatoid arthritis. It is a monoclonal antibody to the receptor for IL-6, a pro-inflammatory cytokine.

Corticosteroid is a classic anti-inflammatory drug class with a plethora of effects on the inflammatory cascade, such as in the IL-6 pathway. Previous research has shown that dexamethasone and methylprednisolone is effective in patients with COVID-19. One of the most notable clinical trials that has shown this effect was the RECOVERY trial on dexamethasone.

RECOVERY trial: Dexamethasone and COVID-19

A total of 2104 patients were assigned to receive dexamethasone and 4321 to receive usual care. Similar to the trial on tocilizumab, the primary outcome was 28-days mortality. Overall, the pooled age-adjusted rate ratio was 0.83 (95% confidence interval, 0.75 to 0.93). Secondary outcomes were: discharge from hospital within 28 days in all patients and risk of invasive mechanical ventilation (IMV)/death among those not receiving IMV at baseline.

Here are the results from this trial:


28-days mortality

Respiratory support subgroups

Interestingly, when analyzing by respiratory support subgroups, there is a clear differential effect of dexamethasone on mortality. Patients on invasive mechanical ventilation benefit the most, while patients not on oxygen tends to get harmed by the therapy.

Adapted from the RECOVERY trial: Effect of **dexamethasone** on **28-day mortality**, according to respiratory support at randomization

Adapted from the RECOVERY trial: Effect of dexamethasone on 28-day mortality, according to respiratory support at randomization

Days since symptom onset subgroups

The time point in which patients start the treatment also plays a vital role in the effectiveness of this drug. There were ** fewer deaths** in patients treated before ≤7 days since symptom onset.

Adapted from the RECOVERY trial: Effect of allocation to **dexamethasone** on **28−day mortality** by other pre−specified baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to dexamethasone on 28−day mortality by other pre−specified baseline characteristics


Discharge from hospital within 28 days

Both respiratory support and days since symptom onset subgroups

Again, this data suggests there might be a difference between subgroups.

Adapted from the RECOVERY trial: Effect of allocation to **dexamethasone** on **discharge from hospital alive within 28 days** by other pre−specified baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to dexamethasone on discharge from hospital alive within 28 days by other pre−specified baseline characteristics


Invasive mechanical ventilation or death

Both respiratory support and days since symptom onset subgroups

Notably, there is a clear difference in the days since symptom onset subgroups.

Adapted from the RECOVERY trial: Effect of allocation to **dexamethasone** on **invasive mechanical ventilation** or **death** in those **not** on **invasive mechanical ventilation at randomisation**, by baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to dexamethasone on invasive mechanical ventilation or death in those not on invasive mechanical ventilation at randomisation, by baseline characteristics



Priors

We have decided to choose the parameters of our belief strengths in light of the data on dexamethasone

We will discuss our priors for the following outcomes in the tocilizumab trial:

  1. 28-days mortality

  2. Discharge from the hospital within 28 days

  3. Invasive mechanical ventilation (IMV) or death in patients not receiving IMV at baseline

Regarding respiratory support, there are different subgroups between the dexamethasone and tocilizumab trial. There is not a no oxygen received subgroup in the tocilizumab trial. Lastly, the oxygen only subgroup in the dexamethasone trial is equal to both no ventilator support and non-invasive ventilation in the tocilizumab trial.

Thus, we ignored the no oxygen received subgroup data and interpreted the oxygen only as analog to both tocilizumab subgroups mentioned above.

We will report the absolute risk difference (ARD) of tocilizumab minus the control group for each prior. Regarding the mortality outcome, an ARD lower than zero means tocilizumab is better.

28-days mortality


Respiratory support

There are three subgroups:

  1. No ventilator support

  2. Non-invasive ventilation

  3. Invasive mechanical ventilation

We have decided that the parameters of priors should be the same for all three subgroups.

Priors: all subgroups

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(< 0) Pr(< -0.05) Pr(< -0.10)
Skeptical 0.00 Moderate 0.05 0.50 0.16 0.02
Optimistic -0.05 Moderate 0.05 0.85 0.50 0.15
Pessimistic 0.05 Weak 0.10 0.30 0.00 0.00

Corresponding dexamethasone data

Adapted from the RECOVERY trial: Effect of **dexamethasone** on **28-day mortality**, according to respiratory support at randomization

Adapted from the RECOVERY trial: Effect of dexamethasone on 28-day mortality, according to respiratory support at randomization



Days since symptom onset

There are two subgroups:

  1. ≤ 7 days
  2. > 7 days

In this case, the parameters of priors will be different for each subgroup, as follows:

Priors: ≤ 7 days

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(< 0) Pr(< -0.05) Pr(< -0.10)
Skeptical 0.00 Strong 0.04 0.5 0.10 0.00
Optimistic -0.05 Weak 0.10 0.7 0.50 0.30
Pessimistic 0.05 Weak 0.10 0.3 0.15 0.06

Priors: > 7 days

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(< 0) Pr(< -0.05) Pr(< -0.10)
Skeptical 0.00 Moderate 0.05 0.50 0.16 0.02
Optimistic -0.05 Moderate 0.05 0.85 0.50 0.15
Pessimistic 0.05 Weak 0.10 0.30 0.00 0.06

Corresponding dexamethasone data

Adapted from the RECOVERY trial: Effect of allocation to **dexamethasone** on **28−day mortality** by other pre−specified baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to dexamethasone on 28−day mortality by other pre−specified baseline characteristics



Discharge from hospital within 28 days

Now, we will discuss the priors for the outcome of discharge from the hospital within 28 days.

Respiratory support

We decided that the priors should match the ones we used for these subgroups regarding the mortality outcome. Of note, for the discharge from hospital outcome, an absolute risk difference (ARD) greater than 0 means tocilizumab is better.

Priors: all subgroups

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(> 0) Pr(> 0.05) Pr(> 0.10)
Skeptical 0.00 Moderate 0.05 0.50 0.16 0.02
Optimistic 0.05 Moderate 0.05 0.85 0.50 0.15
Pessimistic -0.05 Weak 0.10 0.30 0.15 0.06

Corresponding dexamethasone data

Adapted from the RECOVERY trial: Effect of allocation to **dexamethasone** on **discharge from hospital alive within 28 days** by other pre−specified baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to dexamethasone on discharge from hospital alive within 28 days by other pre−specified baseline characteristics



Days since symptom onset

The risk difference between the ≤ 7 days and > 7 days subgroup in the dexamethasone trial is small. Yet, we judged that the priors for the tocilizumab trial should be different in each subgroup.

Priors: ≤ 7 days

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(> 0) Pr(> 0.05) Pr(> 0.10)
Skeptical 0.00 Moderate 0.05 0.5 0.16 0.02
Optimistic 0.05 Weak 0.10 0.7 0.50 0.30
Pessimistic -0.05 Weak 0.10 0.3 0.15 0.06

Priors: > 7 days

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(> 0) Pr(> 0.05) Pr(> 0.10)
Skeptical 0.00 Moderate 0.05 0.50 0.16 0.02
Optimistic 0.05 Moderate 0.05 0.85 0.50 0.15
Pessimistic -0.05 Weak 0.10 0.30 0.15 0.06

Corresponding dexamethasone data

Adapted from the RECOVERY trial: Effect of allocation to **dexamethasone** on **discharge from hospital alive within 28 days** by other pre−specified baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to dexamethasone on discharge from hospital alive within 28 days by other pre−specified baseline characteristics



Invasive mechanical ventilation (IMV) or death in patients not receiving IMV at baseline

Respiratory support

Here, we will only consider data from the oxygen only subgroup of the dexamethasone trial. This is the only dexamethasone subgroup that matches the subgroups (no ventilator support and non-invasive ventilation subgroups) from the tocilizumab trial. Of note, for this outcome, an ARD smaller than 0 means tocilizumab is better.

Priors: all subgroups

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(< 0) Pr(< -0.05) Pr(< -0.10)
Skeptical 0.00 Moderate 0.05 0.50 0.16 0.02
Optimistic -0.05 Moderate 0.05 0.85 0.50 0.15
Pessimistic 0.05 Weak 0.10 0.30 0.00 0.00

Corresponding dexamethasone data

Adapted from the RECOVERY trial: Effect of allocation to **dexamethasone** on **invasive mechanical ventilation** or **death** in those **not** on **invasive mechanical ventilation at randomisation**, by baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to dexamethasone on invasive mechanical ventilation or death in those not on invasive mechanical ventilation at randomisation, by baseline characteristics



Days since symptom onset

For these subgroups, we judged it would be reasonable to choose different priors’ parameters for each subgroup.

Priors: ≤ 7 days

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(< 0) Pr(< -0.05) Pr(< -0.10)
Skeptical 0.00 Strong 0.04 0.50 0.10 0.0
Optimistic -0.05 Weak 0.10 0.70 0.50 0.3
Pessimistic 0.05 Moderate 0.05 0.15 0.02 0.0

Priors: > 7 days

Probability of Treatment Effect
Prior Belief Assumed Mean ARD Belief Strength Assumed SD of ARD Pr(< 0) Pr(< -0.05) Pr(< -0.10)
Skeptical 0.00 Moderate 0.05 0.50 0.16 0.02
Optimistic -0.05 Moderate 0.05 0.85 0.50 0.15
Pessimistic 0.05 Weak 0.10 0.30 0.00 0.06

Corresponding dexamethasone data

Adapted from the RECOVERY trial: Effect of allocation to **dexamethasone** on **invasive mechanical ventilation** or **death** in those **not** on **invasive mechanical ventilation at randomisation**, by baseline characteristics

Adapted from the RECOVERY trial: Effect of allocation to dexamethasone on invasive mechanical ventilation or death in those not on invasive mechanical ventilation at randomisation, by baseline characteristics